Voiced/unvoiced speech discrimination in noise using Gabor atomic decomposition

نویسندگان

  • Arthur P. Lobo
  • Philipos C. Loizou
چکیده

A new algorithm is developed for voiced-unvoiced speech discrimination in noise. Short segments of speech are modeled as a sum of basis functions from a Gabor dictionary. In each iteration, a Gabor atom is fitted (using the matching pursuit algorithm) to the residual obtained by subtracting the best-fit Gabor atom from the previous residual. Multiple discriminant analysis is used to reduce the dimensionality of the vector of Gabor coefficients to give a low-dimensional feature vector for classification. A Radial Basis function neural network is trained on the reduced feature vector set to discriminate between voiced and unvoiced speech/silence segments. On a database of 62 sentences in 5-dB SNR speech-shaped noise, 84% correct classification accuracy was obtained.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Robust voiced/unvoiced speech classification using empirical mode decomposition and periodic correlation model

This paper presents a method of voiced/unvoiced (V/Uv) classification of noisy speech signals. Empirical mode decomposition (EMD), a newly developed tool to analyze nonlinear and non-stationary signals is used to filter the additive noise with the speech signal. The normalized autocorrelation of the filtered speech signal is computed to enhance the periodicity if any. It is considered that the ...

متن کامل

A Comprehensive Noise Robust Speech Parameterization Algorithm Using Wavelet Packet Decomposition-Based Denoising and Speech Feature Representation Techniques

This paper concerns the problem of automatic speech recognition in noise-intense and adverse environments. The main goal of the proposed work is the definition, implementation, and evaluation of a novel noise robust speech signal parameterization algorithm. The proposed procedure is based on time-frequency speech signal representation using wavelet packet decomposition. A new modified soft thre...

متن کامل

New Adaptive Speech Enhancement System Using a Novel Wavelet Thresholding Technique

A new adaptive speech enhancement system, which utilizes a second-generation wavelet transform (SGWT) decomposition and a novel adaptive subband thresholding technique, is presented. The adaptive thresholding technique is based on accurate estimation of subband segmental signal-to-noise ratio (SegSNR) and voiced/unvoiced classification of the speech. First, the speech signal is segmented and ea...

متن کامل

Automatic detection of parkinson's disease from continuous speech recorded in non-controlled noise conditions

Automatic classification of Parkinson’s disease (PD) speakers and healthy controls (HC) is performed considering speech recordings collected in non-controlled noise conditions. The speech tasks include six sentences and a read text. The recording is performed using an open source portable device and a commercial microphone. A speech enhancement (SE) technique is applied to improve the quality o...

متن کامل

Denoising Of Speech Signal By Classification Into Voiced, Unvoiced And Silence Region

In this paper, a speech enhancement method based on the classification of voiced, unvoiced and silence regions and using stationary wavelet transform is presented. To prevent the quality of degradation of speech during the denoising process, speech is first classified into voiced, unvoiced and silence regions. An experimentally verified criterion based on the short time energy process has been ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003